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(3JT), Abstract. Dense sub-graphs of sparse graphs (communities) , which appear in most real-world 

[""t ■ complex networks, play an important role in many contexts. Most existing community detection 

' algorithms produce a hierarchical structure of community and seek a partition into communities 

that optimizes a given quality function. We propose new methods to improve the results of any 
of these algorithms. First we show how to optimize a general class of additive quality functions 
(containing the modularity, the performance, and a new similarity based quality function we 
■ propose) over a larger set of partitions than the classical methods. Moreover, we define new multi- 

scale quality functions which make it possible to detect the different scales at which meaningful 
community structures appear, while classical approaches find only one partition. 
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1 Introduction 



> . 

Recent advances have emphasized the importance of complex networks in many different do- 
mains such as sociology (acquaintance networks, collaboration networks), biology (metabolic 
qq ■ networks, gene networks) or computer science (Internet topology, web graph, p2p networks, 

<^ ', e-mail exchanges). We refer the reader to |2|19|1I12I5| for reviews from different perspectives 

and for an extensive bibliography. 

The analysis of these networks has brought out important and challenging graph algorithm 
q , problems. One of them is community detection, used to uncover structure in large networks: 

the corresponding graphs are generally globally sparse but locally dense; there exist groups 
of vertices, called communities, with many links between them but with few links to other 
vertices. Formally, we consider an undirected graph G = (V, E) with n = \V\ vertices, m = \E\ 
edges. The aim of a community detection algorithm is to find a partition V = {C±, . . . ,Ck} 
of the vertices (Cj D Cj = for i ^ j and Uj Cj = V) that maximizes a given quality function 
Q{V) (see Section EJ. 

Various approaches exist; they belong to a few main methodological categories which 
we succinctly overview here. First, the divisive approach starts from the entire graph and 
successively splits it into more and more communities. Some algorithms achieve this by re- 
moving inter-communities edges (the communities are the remaining connected components) 
according to their betweenness |13l7j or their local clustering [Ej. Others use recursive bi- 
section mechanisms based on minimum cuts |§] or spectral methods JJ. Another family of 
approaches, the agglomerative one, starts from n single-vertex communities and merges them 
successively into larger and larger communities. Some algorithms use hierarchical clustering 
methods according to different similarity measurements based on spectral properties [3] or 
random walks |15|14|2l] . Other algorithms are based on greedy optimization of a quality func- 
tion [3]. Finally, direct approaches try to perform global optimization of a quality function 
|(il8| . or iteratively modify the weight of the edges to make clusters appear [20], have also 
been used. 



Most community detection algorithms induce series of partitions Vo, ■ ■ ■ , V c corresponding 
to the successive steps of the algorithm: Vk+i = ~Pk\{Ck} U ■ ■ ■ > Cj} with Cj. = U] =1 C t - and 
Vq = V . If one considers a divisive algorithm the partitions Vk are obtained in increasing 
order of the steps k, and in decreasing order if one considers an agglomerative algorithm. 

One then define the dendrogram associated to the running of the algorithm as the tree 
in which C[, . . . ,Cj are the children of Ck, with the above notation, for all steps k. Classical 
community detection algorithms output the partition that maximizes a given quality function 
Q among Vq, . . . ,V C - 

We consider in this paper the situation where the dendrogram resulting of a running 
of a community detection algorithm on G is given. There are at most n steps as described 
above (c < n), which produces a set S of c + n subset of V: c subsets Ck corresponding to 
the steps of the algorithm plus n single-vertex sets. Many possible partitions are induced by 
these subsets; we denote 77 the set of all these possible partitions: 77 = {V\\/C G V,C G 
S and UceP C = V and VCi / C2 G V, C\ n C2 = 0}. Intuitively these partitions are given by 
horizontal (but not necessarily straight) cuts of the associated dendrogram (Figure^)). We 
also define in the same manner the sets lie of all the possible partitions of a community C 
in S. The reader must keep in mind that, throughout this paper, we will never consider any 
partition (or sub-partition) containing a community that is not in S. 

Contribution 

We introduce in this paper new post-processing methods to improve the results of any algo- 
rithm that finds hierarchical community structures (encoded by the dendrogram) . We address 
the two following limitations of previous contributions. 

First, we note that considering all the possible partitions in 77 (instead of only the c + 1 
partitions Vo, ■ ■ ■ ,V C ) will necessarily produce better results than the classical method, and 
cannot be worst. The number of valid partitions being exponential in general, it is impossible 
to find efficiently the partition that maximizes an arbitrary quality function. However, we 
will show in Section |2] that this is possible with some reasonable assumptions on the quality 
function Q(V). These results are obtained for a general class of additive quality functions that 
contains the modularity \1'6\ , the performance [I| and a new similarity-based quality function 
introduced in Section |2I 

Second, we propose in Section|21 multi-scale quality functions in order to detect community 
structures at different scales and to determine the most relevant scales at which the graph 
should be observed. We will finally evaluate the benefits of these new approaches with some 
experiments (Section^). 

2 Improving the partition into communities 

In this section, we first introduce a general class of additive quality functions. We show that 
such functions can be efficiently optimized 1 over all the possible partitions V G 77 encoded in 
a dendrogram. 

Definition 1. A quality function Q is additive if there exists an elementary function q, such 
that for any partition V: 

Q(V) = £ q(C) 
cev 

1 Without loss of generality we can consider that the function must be maximized. 
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Let us first show that this definition is not too restrictive by considering three special cases 
of interest. 



The modularity introduced in [12], has already been widely used |3I4|6|7I8|13) . It relies on 
the fraction of edges e(C) inside each community C and the fraction of edges 2 a(C) bound to 
community C: 

Q M (V) = J2<C)-a(C) 2 

cev 

This definition directly induces that the modularity is additive, using q M (C) = e(C) — a(C) 2 . 
We may also notice that it satisfies —1 < Q M (V) < 1, and that each evaluation of the function 
can be done in 0(m). 

The performance j2j counts the number of correctly interpreted pairs of vertices (either two 
vertices belonging to the same community and connected by an edge, or two vertices belonging 
to different communities and not connected by an edge). For a given partition V, we denote 
by C(u) the community containing vertex u in the following definition: 

qP = \{{u,v} £ E,C(u)=C(v)}\ + \{{u,v} j E,C(u) ± C{v)}\ 

\n{n — 1) 

The function Q p is the ratio of correctly identified pairs of links, and so < Q P (V) < 1. Its 
additivity is proved using q p (C) = n(w 1 _ 1) YlueC \{ v G c A u i v ) G E }\ + \{ v $ C A U , V } $ E }\- 
This quality function can be computed in 0(m) and may be generalized to weighted graph 
as discussed in 0. 

A similarity based quality function. This approach supposes that we have a distance dij > 
measuring the similarity between any pair of vertices i and j (the smaller du is, the more simi- 
lar i and j are) . We want to find homogeneous communities by minimizing their heterogeneity 
quantified by the mean square sum of the distances <r(C) = t^t Yli jec ^ij- 

However, minimizing these quantities leads to the partition with n single- vertex com- 
munity. We will counterbalance this by minimizing at the same time the number c(V) of 
communities in the partition. The maximal values of these quantities (namely o~(C) < o~ max 
obtained for C = V, and c{V) < n) are used in the following definition: 

This quality function satisfies —2 < Q S (V) < 0. We prove that it is additive using q s (C) = 
~n ~ • Each evaluation of o~(C) requires 0(|C| 2 ) distance computations for an arbitrary 
distance. However if d is an Euclidean distance then a{C\UC2) can be obtained from o~(Ci) and 
o-(C<z) with only one additional distance computation. Therefore all the o~(C) can be obtained 
with n distance computations in this case. Such a distance, based on random walks, was 
proposed in [15114] together with an agglomerative community detection algorithm which 
computes the values of cr(C). Thus this quality function can be used within this algorithm at 
no additional cost. 

The examples above show that the class of additive quality functions is quite general, and 
that many previously used quality functions actually fit in this class. We will now show that 

2 inter-community edges contribute for | to each community. 
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it is possible to maximize any additive quality function over the set of partitions 77 with a 
simple recursive approach. 

Lemma 1. Given an additive quality function Q and a dendrogram in which the set C has 
children C\, ...,Cfc, the partition V max £ 77c that maximizes Q is either {C} or V\ U ... U Vk 
where V% S 77^ maximizes Q over 11^ and Q(V max ) = max Q(V) = max(g(C),\ Q(Pi))- 

Proof. Suppose that the partition V max 6 77c maximizing Q is not {C}. Then it induces a 
sub-partition Vi E 77^ m each of its child Cj such that V ma x = ^{P%- Now suppose there exists 
V[ £ n Cl such that Q{V-) > Q(Vi) then the sub-partition V' max = V\ U ... U V- U . . . U Vk will 
satisfy, thanks to the additivity, Q(V max ) > QiVmax), which is impossible. 

Theorem 1. Given an additive quality function Q and a dendrogram, it is possible to find 
the partition V £ 71 that maximizes Q in 0(n) plus 0(n) evaluations of the function q. 

Proof. Lemma H guarantees that the recursive function FindBestPartition finds the partition 
maximizing Q over 77 when called on the largest set of vertices V. The function is called 
only once on each node of the dendrogram, thus the total number of calls (and thus the total 
number of evaluations of the elementary quality function q) is | *S* | < In. 



Function FindBestPartition (C) 
foreach child C, of C do 

(Qii'Pi) <— FindBestPartition (Cj) 
end 

if C has no child or q(C) > Qi then 

return q(C),{C} 
else 

return YjjQi^(Pj 

Let us note moreover that some quality functions allow optimizations concerning the 
computation of the q(C): for example it is possible to compute efficiently q(C\ U C2) from the 
values of q{C\) and (/(C2) for the modularity [H] and for random walk quality function |15|14j . 

3 Multi-scale community structure detection 

Even if most community detection algorithms find hierarchical structures of community, they 
generally ouptput only one partition (like in Section EJ. However, communities often appear 
at different scales in complex networks. To overcome this limitation, we will propose here 
multi-scale quality functions which work at different scales. We will then propose a method 
to determine the most relevant scales, highlighting meaningful communities. 

3.1 Multi-scale quality functions 

We will consider in all this section a scale factor < a < 1 going from microscopic to 
macroscopic scales: a = corresponds to smallest communities with only one vertex and 
a = 1 corresponds to the largest community containing all the vertices. We will define multi- 
scale quality functions Q a and the partitions V a maximizing them should be consistent with 
the scale factor, which is captured by the following definition: 
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Fig. 1. (a) Example graph with a multi-scale community structure, (b) Hierarchical commu- 
nity structure (dendrogram) found by the Walktrap algorithm |15ll4j : the heights of the nodes 
represent the steps of the algorithm. The classical approach only considers partitions given by 
straight horizontal cuts on this dendrogram: here a 5-communities partition maximizes the 
modularity Q M = 0.55. (c) Reordered dendrogram according the multi-scale quality func- 
tion Qa ■ Horizontal cuts show the best partition V a for any scale factor a. The maximal 
modularity Q M = 0.57 (obtained for a = |) improves the classical approach. The relevance 
function R(a) indicates two meaningful scale factors (a = 0.42 and a = 0.73) corresponding 
to a 6-communities partition and a 3-communities partition. 



Definition 2. Consider a family of quality functions (Q a )o<a<i, an d denote hyV a the 'par- 
tition in LI Tnaxiuiizifig Qa • Then, Qa 

is a multi-scale quality function if 

eti < "2 =>• V ai -< Tol 2 with V a= o = {{v}\v G V} and V a =i = {V} 

where V ai ^ V a2 iff the sets ofV ai are included in those ofV a2 : VCi G V ai , 3C 2 G V a2 \Ci — ^2 

Note that multi-scale quality functions are quality- functions, and so the notion of additivity 
(Definition^) applies. We propose now a general class of additive multi-scale quality functions. 

Theorem 2. Let us consider a function h over the parts of V such that h(C\ U C 2 ) > h(C\) + 
h(C2): h is higher in macroscopic scales. Likewise, let us consider I such that l(C\ U C 2 ) < 
1{C\) + l(C2): I is higher in microscopic scales. Then functions Q a defined by: 

Q a (V) = ^(C) with q a (C) = ah(C) + (1 - a)l{C) 
are additive multi-scale quality functions. 

Proof. Suppose that ct\ < 02 but V ai ^ V a2 . Then there exist C G V ai and Ci,...,Ck G 
V a2 such that C = C\ U . . . U C&. We have: q ai (C) = oi\h(C) + (1 — ai)l[C) = q a2 (C) + 
(ai — a 2 )/i(C) + (a 2 — ot\)l{C). But V a2 (containing C\,...,Ck) maximizes Q a2 , therefore: 
Qa 2 {C) < q a2 {C\) + . . . + q a2 (Ck). Moreover h and I satisfy: h(C) > h{C\) + . . . + /i(Cfc) and 
1(C) < l(C\) + . . . + l(Ck). Finally with the fact that oi\ < a 2 we obtain: q ai (C) < q a2 (Ci) + 
• • • + Qa 2 (Ck) + («i —oii)(h(C{) + . . . + h(Ck)) + {0,2 — ol\){1{Cx) + . . . + l(C k )). And we recognize 
the inequality q ai (C) < q ai (C±) + . . . + q ai (Ck) which is in contradiction with the fact that 
Vai maximizes Q ai . This prove the main property of Definition [5J 

Then the additivity is immediate and it is simple to check that V a= Q = {{v}\v G V} and 
V a =\ = {^} thanks to the inequalities satisfied by h and I. 

This theorem makes it possible to create an additive multi-scale quality function from 
two elementary functions. These two functions must have opposite growing behavior with 
community sizes and they also have to capture expected properties of communities. We now 
propose suitable multi-scale quality functions which generalize those of Section |2] (the original 
quality functions are obtained back as a particular case for a = ^). 

The multi-scale modularity. We generalize the modularity by introducing the scale factor a 
in its definition: 

Q A J(V) = Y / «e(C)-(l-a)a(C) 2 

cev 

We check that the properties of Theorem [21 are satisfied to assures that Q^f is an addi- 
tive multi-scale quality function. We consider h (C) = e(C) the fraction of internal edges of 
community C and l M '(C) = — a(C) 2 using the fraction of edges bound to community C. We have 
/i M (CiUC 2 ) > h M \Ci)+h M (C 2 ) because e(CiUC 2 ) = e(d)+e(C 2 )+(fraction of edges between d and 
And Z M (Ci U C 2 ) < Z A/ (Ci) + l M {C 2 ) because o(Ci U C 2 ) = a{d) + a(C 2 ) and thus l M \C X ) + 
l M (C 2 ) - l M {C x U C 2 ) = 2a(Ci)a(C 2 ). 
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The multi-scale performance. It is defined in the same manner by: 



P _ a\{{u,v} G E,C(u) = C(v)}\ + (1 - a)\{{u,v} j E,C(u) f C(v)}\ 
Q ° [l) - §n(n-l) 

We use h p (C) = ^ £ ueC |{« € C, {«, u} G £}| and /^(C) = ^ £ ueC IW C, {u, v} £ 
E}\. The two inequalities required by Theorem [21 are easily verified if we remark that h p (C) 
counts the number of edges inside C and l p (C) counts the number non existing edges between 
vertices of C and other vertices. 



A multi-scale similarity based quality function. Using the same idea we can generalize the third 
quality function based on similarity measurement dij between vertices. However, the quantity 
o~(C) measuring community homogeneity must also satisfy a{C\ U C2) > cr(Ci) + afa), which 
is the case for Euclidean distances. 

Q s a (V) = -ac(V) - (1 - a) £ ^ 

h s (C) = —y t trivially satisfies the inequality of Theorem [2j The other inequality satisfied by 
I s (C) = — „ comes from the restriction on d discussed above. 



3.2 Finding the best partition for every scale 

A multi-scale quality function Q a allows us to find a partition V a for any scale factor < 
a < 1. We will show in this section how to compute efficiently all these partitions for the 
general class of multi-scale quality functions defined in Theorem |21 The order between the 
V a (Definition El indicates that ct\ < a<i =>• V ai ^ V a2 ) implies that the total number of 
different partitions V a is at most n. Indeed, each partition is obtained from the previous one 
by splitting at least one community. Therefore the number of communities of the partition 
is at least k. The number of communities of each partition being less than n (the number of 
vertices) we cannot have more than n different partitions V a . 

To determine all the partitions V a , we only need to determine the list of the particular 
scale factors a« at which V a changes (split of a community into sub-communities). The corre- 
sponding modifications induce a new hierarchy into the community structure: the community 
splits can be ordered by the scale factors a« at which they occur. The dendrogram can be 
reordered with this new hierarchy as illustrated in Figure This provides more accurate 
information on the community scales and improves the comparison between them. 

For each partition V, the function Q a (V) = 1{V) + (h(V) — l(V))a can be seen as an 
affine function of the parameter a. Therefore, finding all the best partition V a is equivalent 
to finding the function Qmax( a ) = Qa^Pa) defined by: 

Definition 3. We define the piecewise affine function Qmax( a ) ^at maximizes Q a (V) over 
all the possible partitions V G lie: 

Qmax( a ) = max Q a (V) 

Theorem 3. Given an additive multi-scale quality function Q a satisfying Theorem 03 and 
a dendrogram, it is possible to compute Qmax( a ) by making at most 0(n) evaluations of the 
elementary quality function q a . The additional average complexity is 0(ny/n) for an arbitrary 
dendrogram, it is C(nlog(n)) for balanced one and the worst case is 0(n 2 ). 
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Proof. For a fixed a, and for a community C having children C%, . . . ,Ck in the dendrogram, 

Lemma ^ indicates that maxQ a (V) = max(g Q (C),^ max QaiJ^^))- This equality holds for 

veiJc v&nc i 

any a and thus we deduce Qmax( a ) = max (?a(C), Yli Qrnax(ct))- This prove the corectness of 
the recursive function FindMultiscalePartitions that computes Q^ax by manipulating piece- 
wise affine functions. Qm ax (a) is obtained for the parameter C = V. 

The function is recursively called exactly once on each node of the dendrogram, leading 
to 0(n) evaluations of the elementary quality function q a . Each call also evaluates a sum and 
a maximum operation on piecewise affine functions encoded by the list of their particular 
points (aj, QmeL((Xi))- These operations are done in time linear in the size of the input piece- 
wise functions, the sum of their sizes is at most \C\. Therefore this additional complexity is 
represented by the sum over all the nodes of the dendrogram of operations in We can 

notice that this sum is nothing else than the path length of the hierarchical tree structure of 
community. Classical analysis show that the path length is between ralog(n) and n 2 with an 
average value (over all trees of size n) in 0{n^/n) |18| . 



Function FindMultiscalePartitions (C) 
foreach child C{ of C do 

Qmdx FindMultiscalePartitions (Cj) ; 
end 

if C has no child then 

return a i— > q a (C) 
else 

return a (->• m&x(q a (C), Yji Qmax) 

During the computation, we can keep in memory the communities Ci that are split at each 
scale factor aj. This provides all the necessary information to know at which scale factor a 
each community appears and disappears from the partitions V a . And this makes it possible 
to build the new reorganized dendrogram and all the partitions V a (see Figured). 

If we compare the complexity of this post-processing algorithm to those of the known 
community detection algorithms, we can deduce that it may be integrated after almost any of 
them without changing its overall complexity. Moreover, the hierarchical structures obtained 
from real cases tend to be balanced 0, which is the most favorable case for our complexity. 



3.3 Determining the most relevant scales 

We showed that one can obtain all the best partitions V a for any scale factor a. However all 
these partitions may not have the same relevance in term of community structure. We will 
provide in this section a method to estimate the relevance of these partitions and to retrieve 
the most meaningful scale factors at which clear community structures appear. 

The algorithm of Section 1.121 allows us to know when each community C appears and 
disappears from the partitions V a . Let a m i n (C) and a max (C) be these two scale factors: 
C £ V a for a m i n (C) < a < a max (C). One may consider that the most relevant communities 
will be present for wide ranges of scale factors. We use this to measure the relevance of a 
community C by a max (C) — a m i n (C) and the best scale representing C as a = amax ^ 2 """" ^ . 
These two notions are captured by the following definition. 
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Definition 4. We define the relevance function R a (C) of a community C at scale a by: 

T/iis Zeaefe to the global relevance function R(a) = — \C\R a (C). 

R a (C) is a quadratic function of a. Its maximum is J R( " max(C) ~ am '" (C) ) = a maa; (C) - 

a min (C) and #(ow(C)) = R(a rnax (C)) = Qma " (C) ~ Q """ (C) . It may be used for determining 
the scale factors corresponding to relevant community structures. We can use it to find the 
best scale a which maximizes R(a), but we can also focus on others local maxima of R(a) 
corresponding to others interesting scales. This method allows us to determine several relevant 
scales and thus several relevant partitions (see Figured for an example). 

The computation of R{ct) and its maxima can be done in 0(n). R(a) is a quadratic func- 
tion that can be written as R(a) = Aa 2 + Ba + C between each specific (the aj correspond 
to splits of communities in the hierarchy given by the partitions Va)- At each split, the coef- 
ficients A, B and C are modified according to the coefficients of R a (C) of the corresponding 
communities. The previous algorithm gives us the list of these splits, which allows us to com- 
pute the coefficients A, B and C by updating them at each a^. Each community leads to two 
updates (in constant time) of the coefficients (one when it appears at a max (C) and one when 
it disappears at a m i n (C)), thus the overall complexity is 0(n). 

4 Experimental evaluation 

In this section we evaluate and compare the performances of the different methods and qual- 
ity functions introduced in this paper. Comparing community detection results is a difficult 




Fig. 2. Performance of the different methods measured by the similarity between the partition 
found and the actual generated partition. Left: influence of the size of the graph. Right: 
influence of the modularity of the reference partition. 
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Fig. 3. Influence of the number of communities on generated graphs with n = 1000 vertices. 
Top: similarity between the partition found and the actual generated partition. Bottom: 
modularity Q M of the partition found and of the reference partition. 

task because one needs some test graphs whose community structure is already known. A 
classical approach is to use randomly generated graphs with communities. We will compare 
the partitions obtained by post-processing the results of the same agglomerative algorithm 
|15|14j on a large set of such graphs. 

We generate the test graphs according to the following parameters: the number of vertices 
n, the number of communities c, the average internal and external degrees d{ n and d ou t- We 
divide the n vertices into c equal-sized sets then we draw each possible edge with probabilities 
Pi n or p ou t chosen according to d% n and dout- We evaluate the partition found by comparing 
them to the original generated partition. To achieve this, we use the Rand index corrected 
by Hubert and Arabie |17J10| which evaluates the similarities between two partitions. The 
Rand index I{V\,V2) is the ratio of pairs of vertices correlated by the partitions V\ and V2 
(two vertices are correlated by the partitions V\ and Vi if they are classified in the same 
community or in different communities in the two partitions). The expected value of 7 for a 
random partition is not zero. To avoid this, Hubert and Arabie proposed a corrected index 
that is also more sensitive: V = T 1 Iex T p where I exv is the expected value of 7 for two random 

J-raax J-exp " 

partitions with the same community size as V\ and TV 

We will compare the following approaches: The Classical Modularity (CM) maximizes 
Q M over Vq,...,V c . The Best Modularity (BM) maximizes Q over 77. The Multi-scale 
Modularity (MM) maximizes Q^ 1 over 77 for the most relevant scale factor a given by 72(a). 
Similarly, we define the Best Performance (BP) and the Multi-scale Performance (MP) using 
Q p and the Best Similarity (BS) and the Multi-scale Similarity (MS) using Q s . 

The first test considers a set of 25 000 graphs with different sizes (100 < n < 10000), 
different numbers of communities, different internal degrees 4 < d, m < 10 and different ex- 
ternal degrees, such that the expected modularity of the reference partition satisfies 0.2 < 
Q M {Vref) < 0.6. The results (Figure EJ) show that the performance Q p is not very well suited 
for community detection in sparse networks because it gives too much importance to non- 
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Fig. 4. Detection of communities at two different scales: distance from the macroscopic and 
the microscopic partitions in function of the total internal degree di n = d 1 ™ cro + d™ acro . 

existing edges. We may also notice that the similarity based quality function Q s does not 
produce satisfying results without considering its multi-scale version Q^. And finally we see 
that the classical and the best modularity methods produce good results that are improved 
by the multi-scale approach. 

The two next experiments show the advantages of the multi-scale quality functions. First, 
we will test their ability to find communities at any scale by considering different size of 
communities. We generated a set of graphs with n = 1000 vertices, the same internal degrees 
di n = 3 and the same expected modularity Q^ p = 0.3, but they differ in their number of 
communities 2 < c < 100. The results (Figure |2J) show that multi-scales approaches (MM and 
MS) find the good partition for any number and size of communities while the CM and BM 
approaches have difficulties in finding small communities. 

It is interesting to compare the value of the modularity found by the different approaches. 
Of course the BM method obtains the highest value of modularity, but all the other ap- 
proaches find partitions that are more similar to the reference partition. Moreover, it shows 
that it is possible to find a bad partition (that does not represent the correct scale) with a 
higher modularity than the reference partition. This a disadvantage of the modularity that is 
addressed by the multi-scale modularity proposed in this paper. 

Finally we generated 1000-vertices graphs with two community scales: the vertices are 
divided into 10 communities that are themselves divided into 10 communities. This defines 
a macroscopic and a microscopic partition. The edges are now randomly drawn in order 
to obtain three fixed average degrees d™ cro , d r ™ ,cro and d ou t chosen between 2 and 6. We 
considered the two best scale factors indicated by the relevance function R{a) and compared 
the associated partitions to the two generated partitions. The results (Figure 0J) show that the 
multi-scale quality functions make it possible to find several distinct partitions corresponding 
to different scales. In comparison the BM method, that only find one partition, only detects 
the macroscopic partition. 

5 Conclusion 

We proposed in this paper methods improving the results of any community detection al- 
gorithm finding a hierarchical structure of communityies. First, we showed how to optimize 
additive quality functions over a larger set of partitions than classical approaches. Moreover, 
we proposed multi-scale quality functions that work at different scales and that makes it 
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possible to find more than only one relevant partition. Experiments have shown that these 
methods provide a significant advantage compared to the classical approaches, especially in 
detecting small communities or communities that appear at different scales. 

Moreover, the scale factors associated with each community enable to reorder the dendro- 
gram (Figure ^) , and we are convinced that they could also be integrated in a multi-scale 
visualization tool of complex networks based on community decomposition. 
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